Seeder: discriminative seeding DNA motif discovery

نویسندگان

  • François Fauteux
  • Mathieu Blanchette
  • Martina V. Stromvik
چکیده

MOTIVATION The computational identification of transcription factor binding sites is a major challenge in bioinformatics and an important complement to experimental approaches. RESULTS We describe a novel, exact discriminative seeding DNA motif discovery algorithm designed for fast and reliable prediction of cis-regulatory elements in eukaryotic promoters. The algorithm is tested on biological benchmark data and shown to perform equally or better than other motif discovery tools. The algorithm is applied to the analysis of plant tissue-specific promoter sequences and successfully identifies key regulatory elements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of an Efficient Hybrid Method for Motif Discovery in DNA Sequences

This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...

متن کامل

An Approach to Fault Modeling and Fault Seeding Using the Program Dependence Graph1

We present a fault-classification scheme and a fault-seeding method that are based on the manifestation of faults in the program dependence graph (PDG). We enhance the domain/computation faultclassification scheme developed by Howden to further characterize faults as structural and statement-level, depending on the differences between the PDG for the original program and the PDG for the faulty ...

متن کامل

SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps

Genome-wide maps of transcription factor (TF) occupancy and regions of open chromatin implicitly contain DNA sequence signals for multiple factors. We present SeqGL, a novel de novo motif discovery algorithm to identify multiple TF sequence signals from ChIP-, DNase-, and ATAC-seq profiles. SeqGL trains a discriminative model using a k-mer feature representation together with group lasso regula...

متن کامل

DECOD: fast and accurate discriminative DNA motif finding

MOTIVATION Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The first is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and ...

متن کامل

3-D Turbulence Numerical Simulation for the Flow Field of Suction Cylinder-Seeder with Socket-Slots

The flow field has significantly impact on seeding performance in the suction seeding device. A three-dimensional, incompressible, viscous, RNG turbulence model and the SIMPLE method were used by computational fluid dynamics(CFD), and the flow fields of suction cylinder-seeder with different socket’s radiuses were simulated by Fluent. When vacuum is 4kPa and productivity is 350 trays/h, the sim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2008